CDS

Accession Number TCMCG021C19970
gbkey CDS
Protein Id XP_010931796.1
Location join(15722646..15722750,15726375..15726473,15726717..15726779,15728521..15728598,15730381..15730570,15737252..15737529,15737615..15737990,15742068..15742151,15743849..15744135,15744291..15744347,15748204..15748533,15748615..15748793,15748866..15749148,15751818..15752060,15752176..15752313,15755671..15755811,15755956..15756174,15756411..15756698,15757732..15757938)
Gene LOC105052623
GeneID 105052623
Organism Elaeis guineensis

Protein

Length 1214aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA268357
db_source XM_010933494.2
Definition protein ALWAYS EARLY 2 isoform X3 [Elaeis guineensis]

EGGNOG-MAPPER Annotation

COG_category BDT
Description Protein ALWAYS EARLY
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
KEGG_ko ko:K21773        [VIEW IN KEGG]
EC -
KEGG_Pathway ko04218        [VIEW IN KEGG]
map04218        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGCTTCAACAAGGAAATCAAGAAATGTGAACAAGCGCTTCGCAAAAGTGAATGATGAGTGGCTGGATAAAGATGCAACAGTTGTGAACAAAAGTAAAACACGAAAGAGGAAATTGTCTGACATGTTAGGTACTCAATGGAGCAAAGAAGAGCTTGAGCGTTTCTATGAAGCCTATCGAAAATATGGGAAGGACTGGAGAAAGGTGGCTGGTGCAGTACGTAATAGATCGTCAGAAATGGTGGAAGCTCTGTACAATATGAGTAGGGCATACTTATCCCTTCCAGAAGGAACAGCTACTGCAGCTGGCTTGATCGCAATGATGACTGATCACTACAATATTCTGGAAGGGAGTGACAGCGGTCGTGAAAGCAATGATGTGTCAAGAACATCTCGCAAGACTCAAAAGCGTGGCCGGGGAAAATTTCGACTCATGTCGAAAAGTTCCGATGACCAGTATCCTGATCTATTGCAGTATCAGTCAGGTCCAACAGCCTCAGGATGCCTGTCATTATTAAAAAAGAAGCGATCTGGAGATCTATTTCCAGGTAGCAGACCTCGGGCTGTTGGAAAAAGGACACCCCGTGTCCCTGTTTCAAATATGTATGGCAGAGATGATAGGGATAAAATACTTTCCCCAAATAAGCAAGCATTGAAATCTGTTTCAAACACTGCTGATGATGAAGGTGCCCATGTGGCAGCATTGGCTTTAGCAGAGGTTTCTCAGAGAGGAGGCTCACCACAGCTTTCTAGAACACCTGGAAGAAGAGCTGATCACATGAGATCTTCTCCTGCTAAGAGTGGTGAGAAAAAGAACGCTGAGTCAGAGATGGACAGTTCAAAGTTAGTTGGCGCTCAAATGGAGGGTGACTGTCATGAAGGTAGTTTAGGAAGTAGAGAAGCTGAGAATGGAGATTTTGCTAGAGATGCTACTCATCTGATAGAAAATGAAGGTGCTGCAGCAGTTGAAACTCGAAGGAAGGTGAAGAAACTTCAGGGAAAGAGAAAAAAAGTACCAGCAGACATGGAAAATGATCAACTTGATGATGACAGGGAAGCATGCAGTGGTACTGAAGAAGGCATCAATATTAGAAAGATTAAAGATGAAATTGACGGAGAGACTACGGATGGTAAAACTGCAAGAGGATCCAAAAGTTCAAGGAAAAGAAGCCGTCAGCTATTTTTTGGAGATGAAAGCTCTGCCCTTGATGCTCTACAGACACTTGCAGATTTGTCTGTAAATATCTTGCTTCCTACCTCTACTGTTGAATCTGAATCATCTTTCCAAGTTAAAGAAGAGAAAAGAAACATCGACACTGCTGAGGAGCCTAATATACCTGAATCAATGTCAACGACTCATGAGAGAGATCAGTCCAAAGTTTCAGTGAAAAAGGAGACAGGGTATTCTACAAGTGTTGGTACTGATGCTGTTACCAGGAAGAGTGCTAAGCGTGCAAAGTGTTTACGTCATGATGCTAATGTCATTTCTGAAGTGAAGCAGCAAACTTGTGCATGCACTAGTGAGACGCAGAAAAAAAAGCGGAAGTCTTTGACTGGAAAAGCTTCAAAAGGTGAATTTAATAGTGATGCTCAGAAATATGAACCACAAAAGATAGAGGTCTCAGCAGAAGAAGGGAAGAGATTGGTTGGTAAGACTAGACGTGTTAGTCACGTTAGTTCATCACCGAAGCAAGGAAAATTGGTTAAACTACAGGAGAACTCTTCTTCAAGTACTGATCTAGTTAGACCAATCACAGATTCAAATGAAACAATTGTACAGGCTTCTACCACTTGTCCTGGTAACTTGCTAACCAAAAGTAAAAACCGCCGCAAAATAGGTCTACAGAAAGCATGGGCATTGAAGGAATTTAAATCCAATGAGAGTGCTGTAGGCGATCGTCCTGATAAGTACTTACATCCTGTCAACAGGGGGGTGGTTGATCTCAAGGAAAAACTTTCTCACTGCTTGTCTTCTCGAATGTTGCGGAGATGGTGTATGTTTGAGTGGTTTTACAGTGCAATAGATTATCCTTGGTTTGCCAAAAGTGAGTTTGTAGAGTACCTAAATCATGTGAGATTGGGCCATGTGCCAAGGCTAACTCGTATTGAGTGGGGCGTGATACGAAGTTCTCTTGGAAAGCCACGTAGGTTGTCAAAACAGTTTTTGCAGGAAGAAAGAGAGAAGCTTGAGCAATATCGTGAATCAGTTAGGAAGCATTATGCTGAACTTCGAGCTGGTGTTAGAGAAGGACTCCCAACAGATCTGGCTCAGCCTTTATCAGTTGGGCAACGTGTTATTGCTTGTCATCCCAAAACAAGAGAAATTCATGATGGAAGCATTCTGACTGTTGACCGGAACCGGTGCAGGGTTCAATTTGATCGGCCTGAATTAGGGGTTGAGCTTGTGATGGACATCGACTGCATGCCACTGAACCCATTGGAAAATATTCCTGAAGCACTTAGAAGACAGAATATTGTTGCGAATAAATTTTGCACGAGCTTCGCAGATACAAAGCTAGAAGACGGATCTAAGGAGTGGAAAATTGGAGGCTCCATGAAGTTTGCTCCAGCTGAGAGCTTGGAGATCACAAATGGGTCTTCTAGTATTGCTTCTTCTAGTTATCCGATGCATACCTTAATGAAGCAGGCAAAGGCTAAAGCTACTGTAAATGAAGTTGCTGTTGCTGCACAACAGGCAATGTACAGTCAACCTTGTACATTGTCACAAATACAAGAACGAGAAGCTGACATAAGAGTCCTTGCAGAGTTGTCACGTGCCCTTGATAAAAAGGAAGCTCTGCTCATGGAACTGAGACACATGAATGAAGAAGTTTCTGGAAAGCAAAGGGATGGTGATGCCATTAAAGATTTGGAGCATTTTAGAAAGCAATATGCTATGGTGCTTGTGCAGCTAAGAGATGCCAACGATCAGGTTGCTTCGGCCTTGCTCTCTTTGAGGCAACGCAACACGTACCATGGGAATTCAACACATGCATGGGTTAGACCCATTGAGAATTCGGGGGGGCCTGCTGGACCTGCAGACTCTTGCAATTCATCAGCTTTTCTCAATCAGGATTCAGGATCTCATGTAACTGAGATTGTTGAAAGTTCAAGGCGGAAAGCAAGAACGGTAGTTGATGCTGCTGTGCAGGCTATGTGTGCTTTGAAAGAAGGAGAAGATGCTTTTGTCAAGATTGGAGAGGCTTTAGATTCTGTAAACAGCCGCATTTCTGGACCTGGTTCTGGCGTACTTGGAGTAAGACGTAATCCTCCTGATCCTGGACATGGCGGTTCAGCATATCAAGATCATACAACATCATGCATGCCTGAGGCAACAGCAAGTCATGCTAGTCCAAAACCCCATCTTTCTTCTGATTCAGAGATCCAACTTCCATCAGATCTTATTTCATCATGTGTTGCTACATTGCTCATGATACAGACCTGCACTGAGAGACAATGCCCACCTGCCGAGATTGCGCAGATTCTTGATTCTGCAGTCGCAAGTCTGCAGCCATGTTGTCCGCAGAACCTTCCAATTTACAGGGAGATAGAGACATTTATGGGCATCATTAAGAACCAAATGTTGGCACTGATACCCACTCCAAGCATCATACCACCTGTAGAGGTTCCCATTGTGCAAAAATGA
Protein:  
MASTRKSRNVNKRFAKVNDEWLDKDATVVNKSKTRKRKLSDMLGTQWSKEELERFYEAYRKYGKDWRKVAGAVRNRSSEMVEALYNMSRAYLSLPEGTATAAGLIAMMTDHYNILEGSDSGRESNDVSRTSRKTQKRGRGKFRLMSKSSDDQYPDLLQYQSGPTASGCLSLLKKKRSGDLFPGSRPRAVGKRTPRVPVSNMYGRDDRDKILSPNKQALKSVSNTADDEGAHVAALALAEVSQRGGSPQLSRTPGRRADHMRSSPAKSGEKKNAESEMDSSKLVGAQMEGDCHEGSLGSREAENGDFARDATHLIENEGAAAVETRRKVKKLQGKRKKVPADMENDQLDDDREACSGTEEGINIRKIKDEIDGETTDGKTARGSKSSRKRSRQLFFGDESSALDALQTLADLSVNILLPTSTVESESSFQVKEEKRNIDTAEEPNIPESMSTTHERDQSKVSVKKETGYSTSVGTDAVTRKSAKRAKCLRHDANVISEVKQQTCACTSETQKKKRKSLTGKASKGEFNSDAQKYEPQKIEVSAEEGKRLVGKTRRVSHVSSSPKQGKLVKLQENSSSSTDLVRPITDSNETIVQASTTCPGNLLTKSKNRRKIGLQKAWALKEFKSNESAVGDRPDKYLHPVNRGVVDLKEKLSHCLSSRMLRRWCMFEWFYSAIDYPWFAKSEFVEYLNHVRLGHVPRLTRIEWGVIRSSLGKPRRLSKQFLQEEREKLEQYRESVRKHYAELRAGVREGLPTDLAQPLSVGQRVIACHPKTREIHDGSILTVDRNRCRVQFDRPELGVELVMDIDCMPLNPLENIPEALRRQNIVANKFCTSFADTKLEDGSKEWKIGGSMKFAPAESLEITNGSSSIASSSYPMHTLMKQAKAKATVNEVAVAAQQAMYSQPCTLSQIQEREADIRVLAELSRALDKKEALLMELRHMNEEVSGKQRDGDAIKDLEHFRKQYAMVLVQLRDANDQVASALLSLRQRNTYHGNSTHAWVRPIENSGGPAGPADSCNSSAFLNQDSGSHVTEIVESSRRKARTVVDAAVQAMCALKEGEDAFVKIGEALDSVNSRISGPGSGVLGVRRNPPDPGHGGSAYQDHTTSCMPEATASHASPKPHLSSDSEIQLPSDLISSCVATLLMIQTCTERQCPPAEIAQILDSAVASLQPCCPQNLPIYREIETFMGIIKNQMLALIPTPSIIPPVEVPIVQK